Learning-based Image Enhancement for Visual Odometry in Challenging HDR Environments
نویسندگان
چکیده
One of the main open challenges in visual odometry (VO) is the robustness to difficult illumination conditions or high dynamic range (HDR) environments. The main difficulties in these situations come from both the limitations of the sensors and the inability to perform a successful tracking of interest points because of the bold assumptions in VO, such as brightness constancy. We address this problem from a deep learning perspective, for which we first fine-tune a Deep Neural Network (DNN) with the purpose of obtaining enhanced representations of the sequences for VO. Then, we demonstrate how the insertion of Long Short Term Memory (LSTM) allows us to obtain temporally consistent sequences, as the estimation depends on previous states. However, the use of very deep networks does not allow the insertion into a real-time VO framework; therefore, we also propose a Convolutional Neural Network (CNN) of reduced size capable of performing faster. Finally, we validate the enhanced representations by evaluating the sequences produced by the two architectures in several state-of-art VO algorithms, such as ORB-SLAM and DSO.
منابع مشابه
Visual Odometry in Smoke Occluded Environments
Visual odometry is an important sensor modality for robot control and navigation in environments when no external reference system, e.g. GPS, is available. Especially micro aerial vehicles operating in cluttered indoor environments need pose updates at high rates for position control. At the same time they are only capable to carry sensors and processors with limited weight and power consumptio...
متن کاملCombining Feature-Based and Direct Methods for Semi-dense Real-Time Stereo Visual Odometry
Visual motion estimation is challenging, due to high data rates, fast camera motions, featureless or repetitive environments, uneven lighting, and many other issues. In this work, we propose a twolayer approach for visual odometry with stereo cameras, which runs in real-time and combines feature-based matching with semi-dense direct image alignment. Our method initializes semi-dense depth estim...
متن کاملPlace Recognition in Semi-Dense Maps: Geometric and Learning-Based Approaches
For robotics and augmented reality systems operating in large and dynamic environments, place recognition and tracking using vision represent very challenging tasks. Additionally, when these systems need to reliably operate for very long time periods, such as months or years, further challenges are introduced by severe environmental changes, that can significantly alter the visual appearance of...
متن کاملThe Comparative Effect of Visual vs. Auditory Input Enhancement on Learning Non-Congruent Phrasal Verbs by Iranian EFL Learners
Vocabulary is one of the essential components of language and learning phrasal verbs as part of vocabulary is quite challenging for foreign language learners. The present study aimed at investigating the effects of visual and auditory input enhancement on learning non-congruent phrasal verbs. The participants of the study were 90 intermediate English language learners who were divided into two ...
متن کاملInterpreting the structure of single images by learning from examples
One of the central problems in computer vision is the interpretation of the content of a single image. A particularly interesting example of this is the extraction of the underlying 3D structure apparent in an image, which is especially challenging due to the ambiguity introduced by having no depth information. Nevertheless, knowledge of the regular and predictable nature of the 3D world impose...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1707.01274 شماره
صفحات -
تاریخ انتشار 2017